Vit Large Patch16 224.orig In21k
Apache-2.0
A Vision Transformer (ViT) based image classification model, pretrained on ImageNet-21k by Google Research using JAX framework and later ported to PyTorch. Suitable for feature extraction and fine-tuning scenarios.
Image Classification
Transformers